Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 0381120150370060525
Genes and Genomics
2015 Volume.37 No. 6 p.525 ~ p.535
Effects of omics data combinations on in silico tumor-normal tissue classification
Seok Ho-Sik

Seok Seung-Hwan
Kim Jae-Bum
Abstract
A disease can be characterized by various attributes such as genomic, epigenetic, and transcriptomic features beyond physiological symptoms. The accumulation of vast datasets allows us to investigate the relative effectiveness of each omics data and their combinations for in silico analysis of diseases. Here, we employed a classification method with the well-established measure of information gain for the computational analysis of the effect of the aggregation of omics data, especially for the task of in silico classification of tumor-normal samples for bladder urothelial carcinoma and kidney renal papillary cell carcinoma. We observed that the combination of multi-omics data such as copy number variation, DNA methylation, RNA-Seq, and somatic mutations have beneficial effects. The quantitative analysis using information gain and various measures for classification-performance showed that the combination of multiple omics data improved the performance in general. The qualitative analysis referring previous researches also confirmed the relevance of genes with higher information gain to target diseases. Our results report that the combination of multiple omics data is beneficial and the information gain which focuses on the distribution of attributes across target domains could be useful as an indicator of the effect of each omics data on tumor-normal sample classification.
KEYWORD
Multiomics data, Copy number variation, DNA methylation, RNA-Seq, Somatic mutations, The cancer genome atlas (TCGA)
FullTexts / Linksout information
Listed journal information
SCI(E) ÇмúÁøÈïÀç´Ü(KCI)